A study of two-formant models for vowel identification

نویسندگان

  • Kuldip K. Paliwal
  • William A. Ainsworth
  • D. Lindsay
چکیده

An experiment has been performed where various two-formant models reported in the literature were assessed as to their ability to predict the formant frequencies obtained in a vowel identification task. An alternative model is proposed in which the auditory processing of vowel sounds is assumed to take place in two stages: a peripheral processing stage and a central processing stage. In the peripheral stage the speech spectrum is transformed to its auditory equivalent and the formant frequencies are extracted from this spectrum using a peak-picking mechanism. The central stage performs a two-formant approximation on the results of the first stage operation, and it is this formant pair that vowel identification is taken to operate on during vowel perception. The first and second formant frequencies of this two-formant model are taken to be equal to the first and second formant frequencies extracted at the first stage plus a perturbation term which accounts for the interaction effects of the neighbouring formants. The perturbation caused by each of these neighbouring formants is inversely proportional to its separation from the main formants. This model compares favourably with previous models in its prediction of the formant frequencies obtained from the vowel identification task. Zusammenfassung. In einem Experiment geht es zun~ichst um die Leistung verschiedener aus der Literatur bekannter Zwei-Formanten-Modelle, in einem ldentif;kationstest mit Vokalen erhaltene Formantfrequenzen voraussagen zu k6nnen. Danach wird ein zweistufiges Aiternativmodeli der auditiven Verarbeitung yon Vokalen mit periph~er und zentraler Stufe entwickelt. Auf der periph~en Stufe, wo das Spektrum des Sprachschalls in seinen entsprechenden Geh6rseindruck 0berftihrt wird, erfolgt die Formantfrequenzextraktion aus den Spektren nach dem Prinzip des "peak-picking". Die ERgebnisse dieser Operation liefern der zentralen Stufe die Grundlage einer Zwei-Formanten-Approximation. Auf dieses Formantenpaar diirfte sich die Vokalidentifikation w~ihrend der Vokalperzeption stiitzen. Die Frequenzen des 1. und 2. Formanten unseres Zwei-Formanten-Modells ergeben sich aus den entsprechenden Formantfrequenzen der ersten Stufe unter zusiitzlicher Beriicksichtingung eines Korrekturausdrucks, der den Einfluss tier Nachbarformanten einf~ingt. Der Einfluss jedes der Nachbarformanten ist seinem Abstand zu den Hauptformanten umgekehrt proportional. Gegentiber friiheren Vorschlagen hat unser Modell den Vorzug, dass sich mit ibm die Formantfrequenzen aus dem Identifikationsexperiment besser voraussagen

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study and Comparison of Formant Characteristics of Persian Vowels in 4-7-year-old Children Using Cochlear Implants and Those Using Hearing Aids

Background and Objective: One of the most important physical properties of vowels is their formant structure. One of the most obvious speech errors in hearing-impaired children is vowel errors. The present study aimed to determine and compare the formant structure of Persian vowels in deaf and cochlear implant children in the age range of 4-7 years. Materials and Methods: This descriptive-anal...

متن کامل

Production of English Lexical Stress by Persian EFL Learners

This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...

متن کامل

Acoustic Analysis of Persian EFL Learners' Pronunciation of English Vowels

This paper reports the results of an experimental study on non-native production of English vowels. Two groups of Persian EFL learners varying in language proficiency were tested on their ability to produce the nine plain vowels of American English. Vowel production accuracy was assessed by means of acoustic measurements. Ladefoged and Maddison’s (1996) F1 F2 measurements for American English v...

متن کامل

The Study of Vowel Space and Formant Structure in Mazani Language

Objective: One of the parameters showing the correct phonetic and phonological development is the correct and clear articulation of vowels is achieved by changing the shape of vocal cords through altering the height and position of the tongue and the movement of the lips and jaw. The tongue’s height and position are the basis of the production and difference of vowels. In other words, the raw s...

متن کامل

Perceptual separation of simultaneous vowels: within and across-formant grouping by F0.

Six experiments explored why the identification of the two members of a pair of diotic, simultaneous, steady-state vowels improves with a difference in fundamental frequency (delta F0). Experiment 1 confirmed earlier reports that a delta F0 improves identification of 200-ms but not 50-ms duration "double vowels"; identification improves up to 1 semitone delta F0 and then asymptotes. In such sti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 2  شماره 

صفحات  -

تاریخ انتشار 1983